Bias-corrected paralinear and LogDet distances and tests of molecular clocks and phylogenies under nonstationary nucleotide frequencies.

نویسندگان

  • X Gu
  • W H Li
چکیده

The statistical properties of the paralinear and LogDet distances under nonstationary nucleotide frequencies were studied. First, we developed formulas for correcting the estimation biases of the paralinear and LogDet distances, i.e., the bias-corrected distance is estimated by dc = d - 2var(d), where d and var(d) are the estimated distance and sampling variance, respectively. The performances of these formulas and the formulas for sampling variances were examined by computer simulation. Second, we developed a method for estimating the variance-covariance matrix of paralinear distances, so that statistical tests of DNA phylogenies can be conducted in the nonstationary case. Third, a new LogDet-based method for testing the molecular clock hypothesis was developed under nonstationary nucleotide frequencies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of the relative-rate test under nonstationary models of nucleotide substitution.

Relative-rate tests have previously been developed to compare the substitution rates of two sequences or two groups of sequences. These tests usually assume that the process of nucleotide substitution is stationary and the same for all lineages, i.e., uniform. In this study, we conducted simulations to assess the performance of the relative-rate tests when the molecular-clock (MC) hypothesis is...

متن کامل

Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.

Estimation of evolutionary distances has always been a major issue in the study of molecular evolution because evolutionary distances are required for estimating the rate of evolution in a gene, the divergence dates between genes or organisms, and the relationships among genes or organisms. Other closely related issues are the estimation of the pattern of nucleotide substitution, the estimation...

متن کامل

Genetic Distance for a General Non-Stationary Markov Substitution Process

The genetic distance between biological sequences is a fundamental quantity in molecular evolution. It pertains to questions of rates of evolution, existence of a molecular clock, and phylogenetic inference. Under the class of continuous-time substitution models, the distance is commonly defined as the expected number of substitutions at any site in the sequence. We eschew the almost ubiquitous...

متن کامل

Novel distances for dollo data.

We investigate distances on binary (presence/absence) data in the context of a Dollo process, where a trait can only arise once on a phylogenetic tree but may be lost many times. We introduce a novel distance, the Additive Dollo Distance (ADD), that applies to data generated under a Dollo model and show that it has some useful theoretical properties including an intriguing link to the LogDet/pa...

متن کامل

Nonstationary evolution and compositional heterogeneity in beetle mitochondrial phylogenomics.

Many published phylogenies are based on methods that assume equal nucleotide composition among taxa. Studies have shown, however, that this assumption is often not accurate, particularly in divergent lineages. Nonstationary sequence evolution, when taxa in different lineages evolve in different ways, can lead to unequal nucleotide composition. This can cause inference methods to fail and phylog...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 13 10  شماره 

صفحات  -

تاریخ انتشار 1996